Audio Signal Processing for Activity Recognition of Construction Heavy Equipment
نویسندگان
چکیده
Action recognition and tracking of construction heavy equipment is the first step for benchmarking and analyzing the performance of individual machines and evaluating the productivity of a jobsite as a whole. Aside from direct observations, the current approaches for automatically recognizing and tracking various actions of construction heavy equipment includes: 1) using active sensors such as RFID tags, GPS and accelerometers or 2) computer vision-based activity analysis (processing images or videos). In this paper, we present a novel audio-based approach for activity recognition of construction heavy equipment. Construction machines often produce distinct sound patterns while performing certain activities and it is possible to extract useful information by recording and processing those audio files at construction jobsites. The proposed audiobased framework begins with recording generated sound patterns of construction equipment using commercially available audio recorders. The recorded signal is then fed into a signal enhancement algorithm to reduce background noise commonly found at construction jobsites. The modified audio signal is then converted into a time-frequency representation using the Short-Time Fourier Transform (STFT). A Support Vector Machine (SVM) is then trained to differentiate between the acoustic patterns of the various activities of each machine. The processed audio signal is then finally divided and classified into various activities using a window filtering approach and by setting proper thresholds. We implemented the presented audio-based system at several jobsites as case studies and the results illustrate the efficiency of the system in automatically recognizing various actions of construction heavy equipment. Keywords– Audio signals; Construction; Heavy equipment; Activity recognition; Short-Time Fourier Transform
منابع مشابه
A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملDevelopment of New Portable Equipment for Prototyping and Evaluation on Speech and Audio Signal Processing Applications
For researchers and engineers working on speech or audio signal processing, to implement and evaluate processing algorithms on specific devices such as DSP or FPGA is necessary for realizing real time signal processing applications. Furthermore, evaluation of algorithms in real world environment is very important. However, conventional evaluation boards are not portable or quality of sound inte...
متن کاملA Review of Application of Signal Processing Techniques for Fault Diagnosis of Induction Motors – Part I
Abstract - Use of efficient signal processing tools (SPTs) to extract proper indices for fault detection in induction motors (IMs) is the essential part of any fault recognition procedure. The Part1 of the two parts paper focuses on Fourier-based techniques including fast Fourier transform and short time Fourier transform. In this paper, all utilized SPTs which have been employed for fault fete...
متن کاملApplication of Signal Processing Tools for Fault Diagnosis in Induction Motors-A Review-Part II
The use of efficient signal processing tools (SPTs) to extract proper indices for the fault detection in induction motors (IMs) is the essential part of any fault recognition procedure. The 2nd part of this two-part paper is, in turn, divided into two parts. Part two covers the signal processing techniques which can be applied to non-stationary conditions. In this paper, all utilized SPTs for n...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016